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Abstract: We compute the expected value of the cosmological constant in our uni- 
verse from the Causal Entropic Principle. Since observers must obey the laws of thermo- 
dynamics and causality, the principle asserts that physical parameters are most likely to 
be found in the range of values for which the total entropy production within a causally 
connected region is maximized. Despite the absence of more explicit anthropic crite- 
ria, the resulting probability distribution turns out to be in excellent agreement with 
observation. In particular, we find that dust heated by stars dominates the entropy pro- 
duction, demonstrating the remarkable power of this thermodynamic selection criterion. 
The alternative approach — weighting by the number of "observers per baryon" — is less 
well-defined, requires problematic assumptions about the nature of observers, and yet 
prefers values larger than present experimental bounds. 
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1. Introduction 

The discovery that the universe is in a period of accelerated expansion [1,2], combined 
with an accurate accounting of the total, matter, and radiation components of the en- 
ergy density [3], provide overwhelming evidence for dark energy. These measurements 
are completely consistent with the interpretation of dark energy as a non-zero cosmo- 
logical constant, A. This has undermined the hope that the energy of our vacuum is 
uniquely determined by fundamental theory. Instead, it lends credence to the hypothe- 
sis that the cosmological constant is an environmental variable, which takes on different 
values in widely separated regions of the universe. 
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The observed vacuum energy density^ 

PA = ^ = (1.25 ±0.25) X 10-^23 ^ 

OTT 

is at least 55 orders of magnitude smaller than what would be expected from the stan- 
dard model of particle physics (see, e.g., Ref. [5] for a recent review). The environmental 
approach does not assert that this tiny value is inevitable, or even typical among all 
possible values. Rather, it aims to show that it is not atypical among values measured 
by observers. 

A number of conditions must be satisfied for the environmental approach to work. 
The first, obviously, is that fundamental theory must admit the observed value of the 
vacuum energy. This can happen without explicit tuning if the theory gives rise to an 
enormous number of different vacua. Of course, typical values will be of order unity, 
but if they are randomly distributed, they can form a dense spectrum, or "discretuum" , 
with average spacing of order 1/A^. If A^ ^ 10^^^, then it is likely that the observed 
value is included in the spectrum. Thus, the approach really depends on whether 
fundamental theory (which, presumably, is more or less unique) admits a sufficiently 
dense discretuum. 

The second condition is that the observed value must be dynamically attainable, 
starting from generic initial conditions. With A^ ^ 10^^^ possibilities, there is no 
reason for the universe to start out in a vacuum like ours. The environmental approach 
therefore depends on a means to start from some generic initial value and later realize 
the observed value, either as a branch in the wavefunction or as a particular spacetime 
region embedded in a vast universe. 

Finally, the environmental approach requires an explanation of why we happen to 
observe such an unusually small vacuum energy. Most values of p\ in the discretuum 
will be of order unity, and only a fraction (in the simplest fraction of order 

]^g-i23^ will have a magnitude as small as the observed value. It is not enough to show 
that the small value given by Eq. (|1 . 1| ) is possible; one must also show that it is not 
unlikely to be observed. 

The first condition appears to be satisfied by string theory, which admits as many 
as 10^°° long-lived metastable vacua [6-10] (see Ref. [11] for a discussion of earlier 
work). The second condition can then be met because the vacua are metastable and 
can decay into one another.^ 

^Unless indicated otherwise, all observed values in this paper are taken from Ref. [4]. Where no 
explicit units are given, we set h — Gn — c — — 1- 

^In general, long-lived metastability implies that all matter is diluted before the next decay occurs, 
so the mechanism depends on efficient reheating in the new region. This rules out models that reduce 
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Figure 1: The probability distribution of the vacuum energy measured by typical observers, 
computed from the Causal Entropic Principle, is shown as a solid curve. The values consistent 
with present cosmological data, in the shaded vertical bar, are well inside the la region (shown 
in white), and hence, not atypical. For comparison, the dashed line shows the distribution 
derived by estimating the number of observers per baryon. Unlike our curve, it assumes 
that galaxies are necessary for observers; yet, the observed value is very unlikely under this 
distribution. For more details about both curves, see Figures 2 and 8. 



In this paper we address the third condition. We will use a novel approach, the 
Causal Entropic Principle, to argue that the observed value of p\ is not unlikely. Our 
main result is shown in Fig. 

The Causal Entropic Principle is based on two ideas: any act of observation in- 
creases the entropy, and spacetime regions that are causally inaccessible should be 
disregarded. It assumes that on average, the number of observations will be propor- 
tional to the amount of matter entropy produced in a causally connected region, AS*. 
Vacua should be weighted by this factor to account for the rate at which they will be 

the cosmological constant gradually [12,13]. In the string landscape, the vacuum preceding ours was 
likely to have had an enormous cosmological constant. Its decay acted like a big bang for the observed 
universe and allowed for efficient reheating [6]. 
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observed. 

Crucially, the size of the causal diamond is inversely proportional to the vacuum 
energy, so smaller values of pa allow for greater complexity. This compensates for the 
scarcity of vacua with small p\. As a result, p\ prefers to take a value such that vacuum 
energy begins to dominate near the time of peak entropy production. 

We will find that entropy production in the causal diamond is dominated by dust 
particles heated by stars. This is an important result in its own right: our weight, AS, 
is a simple physical quantity that turns out to be sensitive to the existence of galaxies, 
stars, and heavy elements. 

We will show that the entropy production rate peaked approximately 2 to 3.5 billion 
years after the big bang. It is this time-scale, rather than the time of galaxy formation, 
which governs our prediction of the cosmological constant, and it prefers a range of p\ 
that is in very good agreement with observation. 

The same result also explains the so-called coincidence problem or "why now" 
problem. According to the Causal Entropic Principle, typical observers will exist when 
most of the entropy production in the causal diamond occurs. Our result ensures 
that this happens during the era when the matter and vacuum energy densities are 
comparable. 

Outline Historically, discussions of the third condition — why do we observe an un- 
usually small Pa — have focussed on anthropic selection effects, which we discuss in 
Sec. ^ Long before the discovery of the string landscape, it was noted that not all 
values of pA are compatible with the existence of observers [14-19]. This culminated 
in Weinberg's successful prediction [20] that a small non-zero value of pA would be 
observed, which we review in Sec. p.l| . Weinberg's assumption was relatively modest: 
Observers require galaxies. But astronomers have since discovered galaxies that would 
have formed even if pA had been more than a thousand times larger than the observed 
value. This leaves a large discrepancy between theory and observation. 

A possible resolution is to ask not only about the existence of observers, but to 
weight vacua by the number of observers they contain. This number is generically 
infinite or zero, so a regularization scheme is needed. A popular approach is to weight 
by the number of observers per baryon. In Sec. |2.2| , we argue that this approach is both 
poorly motivated and, in a realistic landscape, poorly defined. Moreover, it does not 
resolve the conflict with observation. To mitigate the discrepancy, one is forced to posit 
increasingly specific conditions for life, such as the chemical elements required. Indeed, 
to do reasonably well, one must suppose that observers can only arise in galaxies as 
large as ours — a very strong assumption, for which there appears to be no evidence. 
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In Sec. p.3| , we motivate and discuss a different approach to this problem. The 
Causal Entropic Principle weights each vacuum by the amount of entropy, AS, pro- 
duced in a causally connected region [21]. This is the largest spacetime region that 
can be probed and across which matter can interact. Since observation requires free 
energy, it is natural to expect that the number of observers will scale, on average, with 
AS*. In other words, we demand nothing more than that observers obey the laws of 
thermodynamics. This is far weaker even than Weinberg's criterion, and one might 
be concerned that it will not be sufficiently restrictive. Yet, in combination with the 
causal cutoff, this minimal requirement becomes a powerful predictive tool. 

This is demonstrated in Sec. |], where we use the Causal Entropic Principle to 
derive the probability distribution for over universes otherwise identical to ours. 
We begin, in Sec. PTT| , by computing the geometry of the causal diamond for general 
Pa- We are mainly interested in its comoving volume as a function of time, Vc{t). In 



Sec. |3.2| , we consider important mechanisms by which entropy is produced within our 
causal diamond. We estimate contributions from stars, quasars, supernovae, and other 
processes. We find that the leading contribution to AS comes from infrared photons 
emitted by interstellar dust heated by stars. 

In Sec. we analyze this leading contribution in more detail. We compute 
dS/{dVcdt), the rate at which entropy is produced per unit comoving volume per unit 
time. This rate will depend on p\ because large pA disrupts galaxy formation and 
thus star formation. Interestingly, however, it turns out that this dependence is not 



important for our final result. In Sec. 3.4, we integrate the rate found in Sec. |3.3| over 



the causal diamond determined in Sec. p7l\ This yields the weight factor, AS{pa). We 



display the resulting probability distribution for pA, and we note that the observed 
vacuum energy lies in the most favored range. 

In Sec. ^ we pinpoint the origins and discuss some implications of our main findings. 
We also identify important intermediate results. 

Extensions In the interest of time and clarity, we have limited our task. We use the 
Causal Entropic Principle solely to compute a probability distribution over positive 
values of the vacuum energy, holding all other physical parameters fixed. This is the 
case most frequently studied in the literature, making it straightforward to compare 
our result with those obtained from the traditional method of weighting by observers- 
per-baryon [22-25]. 

In other words, our distribution is conditioned on the assumptions that pA > 0, 
and that all low energy physics is the same as in our vacuum. We ask only about 
the probability distribution of pA in this subspace of the landscape. This is a valid 
consistency check: Suppose that the observed value were disfavored on a subspace 
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picked out by other observed parameters. Then it would only become less likely when 
the distribution is extended over the entire landscape, and so the model would conflict 
with observation. 



Because negative values of p\ are tightly constrained by standard (though ques- 
tionable) anthropic arguments, the main challenge for environmental approaches has 
been to suppress large positive values of p\. For this purpose, it suffices to concentrate 
on the subset of vacua with pA > 0. This simplifies our analysis, since negative pa 
lead to a different class of metrics. (Interestingly, a preliminary analysis indicates that 
negative values will be somewhat favored by the Causal Entropic Principle, though not 
by enough to render the observed value unlikely.) 



It will be interesting to use the Causal Entropic Principle to compute the prob- 
ability distribution of other parameters, such as the amplitude of primordial density 
perturbations, 5p/p, the spatial curvature, /c, or the baryon to photon ratio, rj. A 
crucial task will be to estimate the probability distribution over multiple parameters 
at once, since this is a much more stringent test for the environmental approach. For 
example, consider a distribution over two parameters, pA and 5p/ p. When weighting by 
observers-per-baryon, the upper bound on pA arises from the requirement that galaxies 
can form. Hence, it would seem highly favorable to increase (5p/p, since pa could then 
be increased by the third power of the same factor. This would render the observed 
values of both pA and 5p/p exceedingly unlikely. As we will discuss in Sec. how- 
ever, galaxy formation is not a significant constraint on pA in our approach. Hence, we 
expect this problem to be virtually absent. 



Given a multivariate distribution, one can ask about the probability distribution 
over one parameter (say, logpA) with other parameters integrated out. As more pa- 
rameters are allowed to vary, the distribution for log pA is likely to become broader 
after they are integrated out. Yet, the observed value must remain typical if the envi- 
ronmental approach is to succeed. The most radical choice is to study the distribution 
of log Pa after integrating out all other parameters characterizing the landscape. This 
would be tantamount to deriving the typical range of log pA from fundamental theory 
alone, without reference to parameters specific to our own vacuum. This would have 
been impossible in conventional approaches. But as we discuss in Sec. |4.1| , AS may 
depend simply on pA when averaged over the entire landscape. Hence, the Causal 
Entropic Principle puts this task within our reach. 
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2. Approaches to weighting vacua 



2.1 The Weinberg bound 

Weinberg [20] estimated the range of p\ compatible with galaxy formation. No galax- 
ies form in regions where pa exceeds the matter density pm at the time when the 
corresponding density perturbations become nonlinear (assuming otherwise identical 
physics). If we grant that galaxies are a prerequisite for the existence of observers, 
then these regions will not contain observers, and such values of pA will not be ob- 
served. Combined with a similar argument^ for negative values of pA, Weinberg [20] 
found that only values in the interval 

-1< ^ < 550 (2.1) 

Pm 

will be observed. 

The upper bound is larger than 1 because the matter density today, pm, has been 
diluted since galaxy formation, by the redshift factor (1 -|- %ai)^- Weinberg used Zgai ~ 
4.5, but in the meantime, dwarf galaxies have been discovered at redshifts as high as 
Zgai ~ 10 [26], raising the upper bound: 

-1 < — < 5000 . (2.2) 

Pm 

The observed ratio of vacuum to matter energy density is much smaller than the 
upper bound: 

— ^ 2.3 . (2.3) 

Pm 

In other words, it would appear that the observed value of pA is in fact quite unlikely, 
even allowing for the anthropic constraint, Eq. ( p.2|) . 

Let us rephrase Weinberg's argument in a more general language. The probability 
distribution for observed values of pA can be written as 

dP , dp dN 

d^.^'^'^P'^dNd^,^ ^'-'^ 

where is the number of vacua with vacuum energy smaller than pA, and p is the 
total prior probability for these vacua. Since pa = is not a special point, vacua in the 



■^With PA < 0, the universe will recoUapse after a time of order \p\\^^/^. If one assumes that most 
observers emerge only after several billion years, then an upper bound on (—pa) results by requiring 
that the universe should not recoUapse too soon [18]. — We will consider only positive values of A in 
this paper. 
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landscape are uniformly distributed when averaged over intervals of p\ of order 10 
or smaller near pA = 0: 

- — = const . (2.5) 
dpK 

Before anthropic selection, it is reasonable to assume that all vacua are equally likely:^ 

— ^ = const . (2.6) 

The anthropic condition of galaxy formation assigns a weight w = 1 to vacua in the 
range of Eq. ( |2.2| ), and w = to all other vacua. Thus, all values of pa in the interval 
of Eq. ( pr^ ) are equally likely, dP/dpx = const. 

Restricting to vacua with pA > 0, it is instructive to consider the probability 
distribution as a function of logpA, 

dP dP . . 

-7] = Pa-1— oc Paw(pa) . (2.7) 

d log Pa apA 

With the above, "binary" weight, this distribution will be a growing exponential of 
log Pa, with a sharp cutoff at logpA ~ —120 from the upper bound in Eq. (|2.2|) . The 
observed value, logpA ~ —123, is suppressed by about 3 orders of magnitude compared 
to values near the cutoff. 

2.2 Weighting by observers per baryon 

In order to reduce this discrepancy, one would need to go beyond the binary question 
of whether or not there are observers. Surely the number of observers will depend on 
the cosmological constant and will begin to decrease for values of pA smaller than the 



upper bound in Eq. (|2.2|) . If we weight vacua by this number, perhaps this will be more 
effective at suppressing large values of pa than a simple binary filter. 

Unfortunately, this strategy is not well-defined without a regularization scheme. 
The dynamics of eternal inflation results in a multiverse containing an infinite number 
of regions for every value of pA- Each region is an open universe with infinite spatial 
volume at all times. (The hyperbolic spatial geometry reflects the symmetries preserved 
by a vacuum bubble formed in a first-order phase transition from a higher metastable 
vacuum.) If a vacuum admits any observers at all, their number will be infinite. 

Various authors [22-24] have proposed to weight vacua by the number of observers 
per baryon, or per photon, or per unit matter mass. But none of these choices are 



''Dynamical effects can modify this flat prior [27]. We shall assume that the resulting distribution 
remains effectively flat, at least for small |pa|. Models violating this assumption are unlikely to solve 
the cosmological constant problem. 
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particularly well motivated. If there are infinitely many baryons, why should it matter 
how efficiently they are converted to observers? Why is a vacuum less likely to be ob- 
served if a smaller fraction of its mass becomes observers, as long as there are infinitely 
many of them? 

More importantly, these regularization methods are not universally defined. This 
makes them inapplicable in a rich landscape, where we will eventually be forced to 
consider vacua with very different low energy physics. Two vacua may have different 
baryon-to-photon ratios, so that the above weighting methods are inequivalent; which 
should we choose? Indeed, it seems unlikely that a standard definition of "baryon" can 
be given that would be meaningful in all vacua. ^ 

These difficulties arise because the regularization refers to a reference particle 
species such as "baryons". But it also refers to "observers", and this leads to ad- 
ditional problems. It seems virtually impossible to define what an observer is in vacua 
with different low energy physics. Even in our own universe, it is unclear how to esti- 
mate the number of observers per baryon. One approximation is to assume that it will 
be proportional to the fraction of baryons that end up in galaxies. But this fraction 
depends strongly on the minimum mass of a galaxy capable of harboring observers, 
M^, which is not known. 

Figure 1^ shows probability distributions for pA, under various assumptions for M*. 
Dwarf galaxies as small as M^, ~ IO^Mq have been detected. With this choice, the 
observed value of A is nearly three orders of magnitude, or 3.5a, below the median 
value. 

There is some evidence that galaxies with mass below IO^Mq will not retain the 
heavy elements produced in supernova explosions [25]. Under the additional assumption 
that such elements are required for life, one may then set M^, to this larger value. But 
the resulting prediction remains unsatisfactory. The median exceeds the observed value 
by a factor of more than two orders of magnitude, or 2.9(J. 

One can speculate that for some reason, life requires a galaxy as large as our own, 
or perhaps even a larger group [22,25] {M^ = lO^^M©). Then the observed value is 
about 1.8(7, or a factor of 22, below the median of the predicted distribution. However, 
at present we can see no compelling arguments for this extreme choice. Thus, the 
weighting by observers-per-baryon leads to a dilemma: Either it requires overly specific 
and questionable assumptions, or else it faces a real conflict with the data. 

As these difficulties demonstrate, weighting by observers-per-baryon may not be 
the correct way to compute probabilities in the landscape. We will now argue for a 

^Ref. [28] proposes an interesting method for defining a unit comoving volume in different vacua, 
in the hmit where bubbles preserve an exact 5*0(3, f ) symmetry. 
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Figure 2: Weighted by "observers per baryon", the probabihty distribution for pA de- 
pends strongly on specific assumptions about conditions necessary for hfe. Three curves 
are shown, corresponding to different choices for the minimum required mass of a galaxy: 
= (10^, 10^, 1O^^)M0. In neither case is the observed value (vertical bar) in the preferred 
range. The choice = IO^Mq (also shown in Fig. 1) corresponds to the smallest observed 
galaxies. The choice = lO^^M© minimizes the discrepancy with observation but amounts 
to assuming that only the largest galaxies can host observers. By contrast, the Causal En- 
tropic Principle does not assume that observers require structure formation, let alone galaxies 
of a certain mass; yet its prediction is in excellent agreement with the observed value (see 
Fig. 8). 

different approach, which is always well-defined. It will allow us to assume nothing 
more about observers than that they respect the laws of thermodynamics. 

2.3 Weighting by entropy production in the causal diamond 

Causal Entropic Principle In this paper we will compute the probabihty distribu- 
tion for Pa based on the Causal Entropic Principle, which is defined by the following 
two conjectures [21]: 

(1) The universe consists of one causally connected region, or "causal diamond". 
Larger regions cannot be probed and should not be considered part of the semi- 
classical geometry. 
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conformal time 

A. 



reheating surface 

Figure 3: A causal diamond is the largest spacetime region over which matter can interact. 
It is delimited by a future lightcone from a point on the reheating surface (orange/light), and 
by a past lightcone from a late-time event (blue/dark); in the case of de Sitter vacua this 
is the cosmological horizon. A vacuum should be weighted by the number of observations 
made in this spacetime region. Since observation requires free energy, we expect that on 
average, this number will be proportional to the amount of entropy, AS, produced in the 
causal diamond. Entropy entering through the bottom cone (bottom arrow), such as the 
CMB, does not contribute to this entropy difference. 

(2) The number of observations is proportional to AS, the total entropy production 
in the causal diamond. 

Before motivating the two conjectures, let us first clarify the key terms — causal 
diamond and entropy — they refer to. 

The causal diamond [29] is the largest region of spacetime causally accessible to a 
single observer. It is defined by the intersection of the past light-cone of a late-time 
point on the worldline with a future light-cone of an early-time point, shown in Fig. |^. 
We choose this time to be the time of reheating, since no matter existed before then. 
(In vacua with no reheating, AS vanishes independently of the choice of the causal 
diamond. This case does not arise here, since we are holding all parameters other 
than Pa fixed.) Only after reheating can matter begin to interact and commence the 
formation of complex structures, at most at the speed of light. 

In a vacuum with negative cosmological constant, the tip of the top cone would be 
on the big crunch. In any metastable vacuum with positive cosmological constant, like 
ours, the top cone is given by the de Sitter event horizon. 

One may question the universality of "reheating surface" or our use of an event 
horizon (a global concept) in a vacuum with finite lifetime, so let us give a more careful 
definition. A causal diamond can be triggered (that is, a bottom cone drawn) as soon 
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as there is entropy in the matter sector. Reheating is a special case: during inflation, all 
entropy is in the gravitational sector (the growing horizon of the inflationary universe) , 
but reheating generates matter entropy (mostly radiation, which we include in this 
class). After vacuum domination in a metastable de Sitter region, the diamond empties 
out at an exponential rate. If we enlarge the diamond by choosing a later point for the 
tip, and the additional spacetime volume contains no matter (which will be generic at 
late times), there is no point in going further. The total amount of matter enclosed by 
the top cone will be the same as if the vacuum had been completely stable. Once the 
metastable vacuum decays, reheating may again occur, in which case a new diamond 
is triggered. This definition implies that if a decay or phase transition happens while 
there is still matter around, the two vacua should not be considered separate, and a 
single causal diamond will enclose both. Thus, we are fundamentally defining the range 
of a causal diamond in terms of the presence of excitations in the matter sector. This is 
well-defined in the entire regime of semi-classical gravity, independently of the details 
of the particle physics. 

In a de Sitter vacuum, the cosmological horizon has entropy given by one quarter 
of its surface area. The relevant area is the cross-section of the top-cone of the causal 
diamond, which grows as matter crosses the horizon. This production of Bekenstein- 
Hawking entropy would contribute enormously to AS. However, it is difficult to see 
the relevance of horizon entropy to the existence of observers. For the same reason, we 
will ignore the Bekenstein-Hawking entropy produced in black hole formation. This is 
also natural since we have defined the causal diamond through this distinction. As we 
have emphasized above, horizons arc a gravitational phenomenon and can always be 
distinguished from the matter sector in the semiclassical regime. Hence, this restriction 
does not affect the universality of our method.^ 

To summarize, we will consider exclusively the production of entropy in ordinary 
matter. We will weight a vacuum by the total entropy increase in the causal diamond: 

w^AS. (2.8) 

Motivation From a pragmatic point of view, one can simply regard this proposal as 
an attractive alternative to weighting by observers-per-baryon. The causal diamond is 
defined independently of low-energy physics, and the entropy increase is a well-defined 

®A different question is whether the exclusion of black hole horizon entropy actually makes a 
qualitative difference to the results of this paper. It seems likely that it would not. According to the 
Causal Entropic Principle, the preferred value of pA is set by the time of maximum entropy production. 
The growth of supermassive black holes, and of their entropy, is largest during the era a few Gyr after 
the big bang and eventually slows down. Thus, it appears to set a similar timescale to the one we 
obtain from stellar entropy production. 
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quantity that replaces more specific assumptions about observers. However, there are 
additional, more fundamental reasons to embrace the Causal Entropic Principle. Let 
us discuss each of the two conjectures put forward at the beginning of this subsection. 

The first conjecture is motivated by the study of black holes (see [21,29,30] for 
details) . There is now considerable evidence that black hole formation and evaporation 
is a unitary process [31,32]. This means that there will be two copies of the initial state 
at the same instant of time, one inside the black hole, and the other in the Hawking 
radiation outside. This would appear to violate the linearity of quantum mechanics. 
However, no actual observer can verify the problem [33,34], since the two copies reside 
in causally disconnected regions of the spacetime. Hence, we can escape from the 
apparent paradox by abandoning the global viewpoint, and be content with merely 
predicting the observations of (any) one observer. 

However, it would be unnatural for such a radical revision of our view of spacetime 
to be confined to the context of black holes. In many cosmological solutions, a single 
observer can access only a small portion of the global spacetime. Hence, it is equally 
important to restrict our attention to only one (cosmological) observer at a time, which 
is what we do in this paper. Descriptions of cosmology from the local viewpoint can 
be found, for example, in Refs. [29,35-38]. Its implications for eternal inflation were 
studied in Ref. [39]. 

The second conjecture replaces far more specific conditions assumed to be neces- 
sary for observers, such as the existence of galaxies, stable planetary orbits, suitable 
chemistry, etc., which were needed in the observers-per-baryon approach. The basic 
idea is that observers, whatever their form, have to obey the laws of thermodynamics. 
Observation requires free energy and is clearly incompatible with thermal equilibrium 
or an empty universe. The free energy, divided by the temperature at which it is radi- 
ated, can be regarded as a measure of the potential complexity arising in a spacetime 
region. This is equal to the number of quanta produced, or more fundamentally, the 
entropy increase AS. 

It seems plausible that there might be an absolute minimum complexity necessary 
for observers, so that subcritical weights A^" < A5'crit should be replaced by zero. For 
example, it seems likely that vacua with pA of order unity, which can contain only a few 
bits, have strictly zero probability of hosting observers (see also Ref. [40]). However, 
any such cutoff does not appear to play an important role for the questions studied 
here. We choose the weight factor to be simply AS, with no cutoff. 

To avoid confusion, we stress that our weighting has nothing to do with the Hartle- 
Hawking amplitude, exp(S'ds), which describes the number of quantum states associated 
with a de Sitter horizon [41]. The number of observers, and of observations, is naturally 
proportional to an entropy difference, AS, not to an absolute entropy or the exponential 
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of an entropy. (Despite our appropriation of the term "entropic principle" , for which 
we apologize to the authors, there is also little relation to Ref. [42].) 

We could have adopted only the first conjecture, and continued to estimate the 
number of observers by more explicit anthropic criteria. This would not have changed 
our final result significantly. But why make a strong assumption if a more conservative 
one suffices? Our results suggest that the poor predictions from weighting by observcrs- 
per-baryon stemmed not from a lack of specificity in characterizing observers, but 
from the regularization scheme ("per baryon"). The causal diamond cutoff solves this 
problem, and allows us to weaken anthropic assumptions to the level of an inevitable 
thermodynamic condition. 

Moreover, AS is a well-defined weight in any vacuum, however different from ours. 
It will be a powerful tool in future work, when parameters other than pA are allowed to 
vary simultaneously. We are thus motivated to use our second conjecture throughout. 
We will find that in our own universe, it captures conditions for observers remarkably 
well. 

3. Computing p\ from the Causal Entropic Principle 

In this section, we will compute the probability distribution over p\, holding all other 
physical parameters fixed. We do this in four steps. First, we will compute the geometry 
of the causal diamond as a function of p\. Next, we will identify important effects 
that produce entropy within the causal diamond. Then we will determine the time- 
dependence of the entropy production rate per unit comoving volume, as a function 
of Pa- Finally, we will fold this together with the time-dependence of the comoving 
volume contained in the causal diamond to obtain the weight factor, AS{p\) . 

3.1 Metric and causal diamond 

Current data are consistent with a spatially fiat universe. Hence, we will assume that 
since the time of reheating, the large scale structure of our universe is described by a 
spatially flat FRW model: 

ds'^ ^ -dt^ + a{t)^d:x.^ . (3.1) 

(Actually, wc arc making the stronger assumption that the cosmological constant domi- 
nates before curvature does, for all values of pa considered here. Thus, we are assuming 
that the universe is flatter than necessarily required by current constraints.) 

After reheating, the universe will be dominated first by radiation, then by matter, 
and finally by vacuum energy. The reheating temperature will not be important; in 
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fact, we will neglect the radiation era altogether. Instead, we extrapolate the matter- 
dominated era all the way back to the big bang (t = 0), where we will place the bottom 
tip of the causal diamond. This approximation is justified because the radiation era 
contributes only a small fraction of conformal time in the range of values of p\ that 
have any significant probability. This is shown in detail in Appendix 0. 

Thus, we treat the universe as containing only pressureless matter and a cosmolog- 
ical constant. At early times (matter domination), the scale factor will be proportional 
to t^^^ , independently of p\. At late times (vacuum domination) it will grow like 
exp(t/tA), where 

U=a/|- = a/^, (3.2) 



V A V Sttpa 

(In our universe, with pA given by Eq. (|1.1|) , we have ^a ~ 0.98 x 10^^ = 16.7 Gyr). 
An exact solution (aside from the neglected radiation era) that includes both 



regimes is 



a{t) 



t\ sinh 



p{t) = PA + Pr, 



3 t 
2U 



PA 



2/3 



1 + 



sinh 



3_t 

2tA ; J 



(3.3) 



(3.4) 



The prefactor, t^/^, is arbitrary and can be changed by rescaling the spatial coordinates. 
Our choice ensures that for solutions with different values of A, the scale factors will 
agree at early times not only by diffeomorphism, but explicitly. This is convenient 
because A is dynamically irrelevant at early times. 

Vacuum energy begins to dominate the density when pA = Pm, at t = 0.59 tA (in 
our universe, 9.8 Gyr after the big bang). Acceleration begins earlier, when p + 3p = 
Pm — 2pA = 0, at 0.44 tA (in our universe, at 7.3 Gyr). 

In order to compute the boundaries of the causal diamond, it is convenient to work 
in conformal time, t = J dt/a{t). The metric becomes 



ds" = a{Tf[-dT'' + d^^] . 

Light-rays obey ds = 0, and hence for radial light-rays, dr = ±dr, where r 
Using our solution, Eq. (p.3|), conformal time will be given by 



T{t) 



(3.5) 




(3.6) 
(3.7) 
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It has finite range, running from 

r(0) = -^^ll^ptf ^ -2.804 (3i 

at tlie big bang, to r(oo) = at asymptotically late times. The total conformal lifetime 
of the universe is Ar = r(oo) — r(0) = 2.804 1 



1/3 
A 



The causal diamond is given by the region delimited by 

(3.9) 



Ar 



At 



The comoving volume inside the causal diamond at conformal time r is simply 

K = -jr\r) . (3.10) 

This is shown in Fig. ^, as a function of proper time, for several values of pA- The 
maximum comoving volume occurs at the edge of the causal diamond at conformal 
time r(0)/2, or equivalently, 

tedge ~ 0.23tA (3.11) 

(in our universe, tgdge ~ 3.9 Gyr). The maximum comoving volume itself is V^(r(0)/2) 

ii.etA. 

At late times, the comoving volume goes to zero. This reflects the exponential 
dilution of all matter, which is expelled from the diamond by the accelerated expansion. 
The physical radius approaches a constant, ^a, the horizon radius of the asymptotic 
de Sitter space. Note that the "comoving four-volume" , 

Vi= I Vcit)dt , (3.12) 
Jo 

is finite. It is proportional to t\, and hence, inversely proportional to the cosmolog- 
ical constant. This will be important, since it means that smaller values of pA are 
rewarded by a larger causal diamond, and thus, potentially greater complexity. This 
can compensate for their rarity. 

3.2 Major sources of entropy production 

To calculate dP/d{\ogp^), we need to calculate the total entropy production in the 
causal diamond as a function of the cosmological constant, AS'(pa). We have deter- 
mined above how the causal diamond depends on pA, but we must also understand 
how the rate of entropy production depends on pA. We begin by identifying the major 
sources contributing to AS* in the causal diamond in our own vacuum. 
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Figure 4: The lower plot shows comoving volume in the causal diamond, 14, as a function of 
time for = (0-1, 1, 10) times the observed value given in Eq. (LI). The kink in the comoving 
volume corresponds to the "edge" of the causal diamond, where the top and bottom cone 
meet (see Fig. 3). The upper plot shows the rate of entropy production computed in Sec. |3.3| , 
which peaks around 2 to 3.5 Gyr. As explained in Sec. the Causal Entropic Principle 
prefers values of p\ such that the (pA-dependent) peak of the comoving volume coincides with 
the (nearly /)A-independent) peak of the entropy production rate (see Fig. 9. 



First, let us discuss sources which can be neglected. Because the causal diamond 
is small at early times <^ 1), and empty at late times (1 — <^ 1), the most 
important contributions will be produced in the era between 0.1 Gyr to 100 Gyr, when 
the comoving volume shown in Fig. ^ is large. Hence, we can disregard the entropy 
produced at reheating, at phase transitions, or by any other processes in the early 
universe. Virtually all of this entropy (in particular, the cosmic microwave background) 
entered the causal diamond through the bottom cone and does not contribute to AS". 
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For the same reason, we neglect the entropy in Hawking radiation produced by 
supermassive black holes. (One might contemplate the possibihty of a "planet" orbiting 
such an object and exploiting its very low temperature radiation as a source of free 
energy.) However, the timescale for the evaporation of a black hole is enormous (M^). 
By the time this entropy would be produced, a typical causal diamond, on which the 
measure for prior probabilities is based in the local approach [21], will be completely 
empty. 

Having dismissed effects at small comoving volume, we turn to processes which 
operate from 0.1 Gyr to 100 Gyr, when the comoving volume is large. In this era, 
entropy is produced by baryonic systems, and we can gauge the importance of a given 
process by its total entropy production per baryon, or equivalently, per unit mass, or 
unit comoving volume,^ dS/ dVc- This can be estimated as the ratio between the amount 
of energy released per baryon, and the temperature at which this energy is dominantly 
radiated. 

Stcirs Ten percent of baryons end up in galaxies, and thus in stars. Approximately 
ten percent of those baryons actually burn in the course of the lifetime of the star. 
The energy released is about 7 MeV for each baryon in the reaction 4p — > ^He, or 
about 0.7 X 10~^ of the rest mass of the proton. Stars radiate at varying temperatures 
corresponding largely to optical wavelengths (about 0.2 to 3 eV). However, more than 
half of this radiation is reprocessed by dust [43].^ It is re-emitted in the infrared, at 
approximately 60 ;um, or 20 meV. Hence, there will be more than 100 infrared photons 
per optical photon [44], and the infrared emission will dominate the entropy produc- 
tion. In summary, stellar burning produces an entropy of order 10^ per baryon, after 
reprocessing by dust. 

Active galactic nuclei Active galactic nuclei (AGNs) appear to be the main com- 
petitor to stars in terms of luminosity and entropy production.^ Approximately 10^'^ of 
the total stellar mass (i.e., 10"'' of baryonic mass) ends up in supermassive black holes, 
and perhaps 10% of this energy is released during the violent accretion process. This 
suggests that AGNs release at most about one sixth of the energy as compared with 
stellar burning. The effective temperature will again be 20 meV, since a large fraction 
of the radiation is reprocessed by dust. Hence, AGNs appear to produce somewhat 
less total entropy per baryon than stars, and we will neglect their contribution in the 
present paper. 

''The choice of reference unit does not affect our weighting, because it drops out after integrating 
over the causal diamond. 

®We thank Ehot Quataert for pointing this out to us. 

^We thank Petr Horava for drawing our attention to AGNs. 
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In Ref. [45], the intrinsic luminosities of all AGNs above observational limits were 
compiled to create a quasar luminosity function applicable back to about 1 Gyr after 
the big bang. This luminosity function suggests that the entropy production rate from 
quasars is more narrowly peaked than that estimated for stars in Fig. ^. But even at 
the peak, near a redshift of z = 2, the quasar luminosity is a factor of 3 lower than the 
present stellar luminosity [46] (which, in turn, is about one order of magnitude lower 
than the peak stellar luminosity). This seems to rule out the possibility that AGNs 
ever dominated the entropy production rate. 

Other potentially important contributions come from galaxy cooling and from su- 
pernovae. Even assuming reprocessing by dust, neither of these phenomena can com- 
pete with stellar burning, because they run on less energy but not at lower temperature. 



Supernovae We focus on core collapse supernovae since they are more abundant 
than type la supernovae (by factor of 4-5) while producing comparable luminosity per 
event. They occur in all stars with more than eight solar masses, which constitute 1% 
of the total stellar mass [see Eq. ( p.l7|) ]. The collapse of an iron core into a neutron star 
releases gravitational binding energy not much smaller than the core mass (1.4 solar 
masses). Thus roughly 10% of the total mass of the progenitor is released. Most of 
this energy is carried away by high energy neutrinos, producing little entropy. Only 1% 
produces optical photons, and is reprocessed by dust into infrared photons. Altogether, 
supernovae thus convert a fraction of 10~^ of stellar baryonic mass into soft photons. 

Further quantitative analysis confirms the above estimate. We find that entropy 
production from supernovae is more than order of magnitude below the contribution 
from dust heated by stars. 



Galaxy cooling A typical galaxy like ours has a mass-to-radius fraction of approx- 
imately 10^^. This fraction of the galaxy mass is converted into kinetic energy at 
virialization. This energy, about 1 keV per stellar baryon, is converted into radiation 
as the galaxy cools. The virial temperature is about 10^ K, or 10 to 100 times greater 
than the temperature of a star. Even assuming reprocessing by dust, galaxy cooling will 
produce less than 10^ photons per baryon. This is more than two orders of magnitude 
below the entropy production from dust heated by stars. 



3.3 Entropy production rate 

We have argued that the dominant source of entropy production in our universe is dust 
heated by starlight. In this subsection we will consider the rate at which this entropy 
is produced. We will ask how it depends on time and on p\. 
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Time dependence Deriving dS/ dVc dt from first principles would require a detailed 
description of all of the dynamics that led up to stellar burning, such as non-linear 
evolution, gas cooling and disk fragmentation. Instead, we will take advantage of ob- 
servations that quantify the time-dependence of the star formation rate. This will allow 
us to obtain the entropy production from dust heated by stars. We will then estimate 
how this rate changes in universes with different cosmological constants. Surprisingly, 
this latter dependence will not be important for our final result. 

In recent years, observations of the extragalactic background radiation in a large 
range of wavelengths have improved our understanding of the galaxy luminosity func- 
tion. This has allowed astronomers to infer the star formation rate (SFR) in galaxies, 
and its evolution in time [47,48] (see [49] for a review). The SFR is defined as the rate 
of stellar mass production per comoving volume 

(3.13) 

This function is constrained by observation through a variety of techniques. For ex- 
ample, UV emission from star-forming galaxies is dominated by massive stars that are 
short-lived. Due to the short lifetime, luminosities in these wavelengths track the birth 
rate of stars [50] . In addition, detailed surveys of the local universe constrain the SFR at 
low redshift [51,52]. Bounds on the rate of type II supernovae from Super-Kamiokande 
and KamLAND also place an indirect bound on stellar birth [53]. 

The combination of these measurements constrain the SFR back to redshifts of 
about z 6, when the universe was 1 Gyr old. Since then, the SFR may be grossly 
described as a smooth function that peaks at around 2.5 Gyr and subsequently decreases 
exponentially with time. The SFR today is roughly one order of magnitude less than 
its peak value. As we shall see, the era of peak stellar formation, which we will call i*, 
will play an essential role in determining the cosmological constant using the Causal 
Entropic Principle. 

Several authors have postulated models or functional forms for the SFR, fitting 
model parameters to the data, for example [54-57]. Variations between the fits lead to 
a range for ~ 1.5 — 3 Gyr. 

We will illustrate our computation using two different SFRs, in order to illustrate 
the systematic dependence of our calculation on the time dependence of star formation. 
The first SFR is from Ref. [56] (labeled "N" in plots) and has ~ 1.7 Gyr. The second 
SFR, from Ref. [57] (labeled "H"), peaks at U ~ 2.8 Gyr. We will find in both cases 
that the observed value of pA lies well inside the la region of the resulting probability 
distribution. 
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Figure 5: The star formation rate as a function of time from Nagamine et al. [56], labeled 
"N", and from Hopkins and Beacom [57], labeled "H". They are peaked at ~1.7 Gyr and 
2.8 Gyr respectively. We have normalized both SFRs such that the stellar luminosity density 
calculated below agrees with the bolometric luminosity observed today. It should be noted, 
however, that our result is not sensitive to this re-normalization. 

Both SFRs are shown in Fig. |^ re-normahzed. One of the biggest uncertainties 
regarding the determination of the star formation rate is the overall normalization of 
the curve. Fortunately our result is completely insensitive to this overall normalization; 
it depends only on the shape of the SFR. 

For concreteness, we have normalized the SFR such that the implied stellar lu- 
minosity today matches the observed bolometric luminosity from low-redshift stars, 
10^'^Lq/Mpc^. Our normalizations differs from those in the original works since we 
have used rather rudimentary formulae to compute the present luminosity from the 
SFR. However, the shape of the total entropy production rate derived with these sim- 
ple formulae agrees well with that in Ref. [56] when this SFR is used. 

With the rate of star formation in hand, one can estimate the rate of entropy 
production by stars. Let us first consider a single star of mass M born at a time t' . Its 
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entropy production rate is the stellar luminosity divided by the temperature at which 
photons are radiated, 

Here we have assumed a mass-luminosity relation 

oc M^-s . (3.15) 

(We use to refer to total stellar mass, and M to refer to the mass of a specific star. 
iV^ denotes star number.) The effective temperature of 20 meV is that of the dust 
which reprocesses about one half (hence the prefactor) of the starlight and dominates 
photon number. 

The star will only produce entropy over a finite time that also depends on M: 

i^iM)-[-w) 10'° y^- (3-16) 

Now let us consider a whole population of stars that are formed at a time t' . At 
birth, stellar masses are observed to be distributed according to a universal, time- 
independent function known as the initial mass function (IMF). The Salpetcr distribu- 
tion [58], which goes as M~^-^^, agrees reasonably with observation, but has since been 
refined by many authors. In the present calculation we will use a modified Salpeter 
IMF of the form [57] 



eiMF(M) = ^ - I ^^^_i.5 ^ ^ Q , (3.17) 

where the constants Ci,2 are set by requiring that the IMF is continuous and that it 
integrate to one over the range O.OSM© < M < IOOM0. 

We now have all the ingredients in place to calculate dPS/ (dM^^dt), the contribution 
of a stellar population that is born at time t' to the entropy production rate at some 
later time t > t', per unit initial stellar mass. This rate is a function only of the time 
difference t — t' 

d^S , 1 z-^— (*-*') 



1 ril'Jmaxl.t-rj J2 



dM^dt 

with the average initial mass (defined at f), (M), defined as 

"lOOM© 



(M) = / dM^iMpM ^OASMq . (3.19) 

Jo.OSMq 
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Figure 6: The entropy production rate in our universe as a function of time, from dust 
heated by starhght. The two curves shown correspond to different models of the star formation 
history of the universe [56,57]; see Fig. 5. 



The time dependence enters Eq. (|3.18|) through the death of stars of various masses at 
different times and thus appears in the upper hmit of the upper integral. It is derived 
by inverting equation ( |3.16| ): 

for t-t' < 10^ yr 
Mq for t-t' > 10^ yr 




The lower limit of the integral is set by the minimal mass of stars that can sustain 
nuclear burning, 

Minin = O.O8M0 . (3.21) 

These stars burn for ~ 5000 Gyr, living well into vacuum domination in our universe. 

In order to calculate the entropy production rate in the universe at time t we must 
convolve (fS/{dM^dt) with the SFR. That is, we integrate over the birth times t' of all 
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populations of stars born before the time t: 

dV^v' Jo dM^dV 'dVcdt'^ ^ 

This function is plotted in Fig. ^ for the two forms of the SFR. It is a smooth 
function that peaks when the universe is about 2.3 Gyr old and 3.3 Gyr old for the 
two curves plotted. This timescale is set by the peak of the star formation rate and 
the mean lifetime of a star in our universe. The entropy production rate decreases as 
stars born during the peak of the SFR begin to die. But due to the high abundance of 
long-lived low mass stars, dy^S maintains a finite value long after star formation has 
ceased. 

In our approximation, the entropy production rate is half the luminosity of stars 
at the time t, divided by the effective temperature (dust at 20 meV). Modulo this 
rescaling. Fig. ^ thus also shows our estimate for the luminosity, which was used for 
normalization as described above. 

We caution that we assumed overly simplistic formulae for the luminosity and 
the lifetime of a star. For example, we ignored the dependence on metallicity, and 
a dependence of the exponent in Eq. ( |3.15| ) on the mass. This will likely affect the 



shape of the entropy production rate somewhat, as will corrections from other sources 
of entropy. However, our prediction for pA depends only on the roughest features: the 
width and position of the peak of the entropy production rate. Hence, it is unlikely 
that our result would be qualitatively affected by our simplifications. 

Dependence on pA The entropy production rate calculated above is that in our 
universe. In order to determine A5'(pa); we will need to calculate this function for uni- 
verses with different values of the cosmological constant. Interestingly, this dependence 
will be of little importance for our final result. 

Stars have decoupled from the global expansion of the universe, so their internal 
dynamics is unaffected by variations of pa- However, the value of pa affects the fraction 
of baryons that are incorporated in halos large enough to form stars at any given time, 
thus affecting the star formation rate, and ultimately the entropy production. For 
example, if pA is large enough to violate Weinberg's anthropic bound, no baryons will 
be in star forming halos and AS* will be very suppressed. 

In a universe with a vacuum energy density pa, the fraction of matter that is incor- 
porated in halos of a mass Mq by time t or above, F{p\, Mq, t), is easily calculated in 
the Press-Schechter (PS) formalism [59]. The formulae for the PS fraction are summa- 
rized in [4]. (This fraction is also a function of the amplitude of density perturbations 



- 24 - 



and the temperature at matter-radiation equality, but since these are held fixed in this 
calculation we will suppress them.) 

For the purpose of our calculation, we will consider a galaxy to be star-producing 
if the mass of its host halo is lO^M© or above. Note that this choice involves no 
speculation about what observers need. The Causal Entropic Principle requires us to 
compute the entropy production rate as a function of p\. This rate depends on whether 
or not stars, the dominant contributors of entropy, actually form. For Mq > lO^M©, 
the virial temperature of the halo is above 10^ K, enough to support rapid line cooling 
and efficient stellar production. (The first generation of stars — Population III — were 
formed in galaxies with even lower masses, but since these stars have been neither 
observed nor accounted for in the observation-based SFRs, we will not consider them 
further.) In any case, as we shall see, our final result will be quite insensitive to this 
choice. 

Based on the SFR Pv,(t) in our universe, we will now estimate the SFR in a universe 
with a different vacuum energy, p^(pA,i^)- The SFR in our universe is peaked at about 
~ 1.5 — 3 Gyr, which is still in the matter dominated era. The cosmological constant 
played no dynamical role and cannot have anything to do with this peak. Rather, this 
time scale is set by astrophysical dynamics, such as galaxy formation, cooling, and 
feedback. 

Therefore, the star formation rate depends mainly on the mass fraction in star 
forming galaxies at the critical time t^. Strictly speaking, the SFR will be sensitive 
to the PS fraction at times before because of the cooling period that is required 
between the time a baryonic structure goes non-linear and the time it collapses into 
stars. Baryons that burn during the peak of the SFR actually fell into non-linear halos 
a cooling time earlier. We leave a more careful analysis of these and other effects to 
further investigation. 

Let us define F^{p\) as the PS fraction that is most relevant for star formation in a 
universe with cosmological constant p\, namely that evaluated at t^, with a minimum 
mass of IO^Mq. In order to capture the mild sensitivity of the SFR to changes of the 
cosmological constant, we rescale the SFR by the appropriate F^: 

Mp».0 = P.Wx^r(^^^^^. (3.23) 

Because the observed value of pa is far from disturbing the formation of IO^Mq 
galaxies, small variations of pa will barely affect the mass fraction F^; see Fig. |^. But 
they do affect the geometry of the causal diamond. This is the reason why the latter 
effect will be important for our final result, while Eq. ( p.23|) gives only a tiny correction. 
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Figure 7: The entropy production rate of Fig. |^ depends only mildly on the vacuum energy. 
Hence, dependence on the vacuum energy enters our calculation mainly through the size of the 
causal diamond (see Figures 4 and 9). The rate of Ref. [56] is shown here for p\ = (0.1, 1, 10) 
times the observed value (from top to bottom) using the approximation of equation |3.23| . 

3.4 Total entropy production in the causal diamond 

From the above results, we can compute the total entropy production 

poo 

ASip^) = / dtV,{pj,,t)dvJ{pA,t) . (3.24) 
Jo 



Here, Vc is the comoving volume in the causal diamond at the time t, given in Eq. ( p.lO|) . 



dv^S is the rate of entropy production per unit comoving volume, given in Eq. ( |3.22|) 



The dependence on px enters mainly through Vc. It is straightforward to perform the 
integrals numerically. 



By Eqs. ( p.7|) and (|2.8| ), the probability distribution dP/d{\og pa) is proportional 
to p\AS{pa). We show this distribution in Fig. ^ for both the Nagamine et al. [56] as 
well as the Hopkins et al. [57] SFRs. For the Nagamine et al. SFR, the median value 
is Pa = 5.6 x 10^^^^ with the la error band between 4.2 x 10~^^^ to 5.8 x 10~^^^. For 
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Figure 8: The probability distribution over log (pa) computed from the Causal Entropic 
Principle. The two curves shown arise from two different models of the star formation rate 
(see Figs. ^ and Their differences hint at the systematic uncertainties in our calculation 
that arise since the history of entropy production is not known to arbitrary precision. These 
uncertainties are apparently irrelevant to our main conclusion: either way, the observed value 
of PA (vertical bar) is not unlikely. 



the Hopkins et al. SFR, we find the median value pA = 2.1 x 10 with the la error 
band between 2.4 x 10-^^4 ^ 4 ^ io-i22_ 

There are "systematic uncertainties" in our calculation of the probability distribu- 
tion, which come from our lack of knowledge of the precise history of entropy production 
in our universe. The comparison between the two models for the SFR gives a good 
sense of their size. It shows that these uncertainties do not affect our conclusion: with 
either choice, the observed value is well within the la region, and hence, not unlikely. 
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4. Discussion 



4.1 AS captures complexity 

Our main quantitative result is the probability distribution for p\. However, we have 
also discovered an important intermediate result: in our own causal diamond, the 
dominant contribution to entropy production comes from the infrared glow of dust 
heated by starlight. 

This is remarkable. It shows that a seemingly primitive quantity, AS, captures 
many of the conditions that are usually demanded explicitly by anthropic arguments. 
AS would drop sharply if galaxies, stars, or heavy elements were absent. Accord- 
ing to the Causal Entropic Principle, the weight carried by such a vacuum would be 
suppressed. 

For example, consider a universe like ours, except without heavy elements. (This 
could be arranged by adjustments in the standard model.) Galaxies would still form, 
and stars would burn, but there would be no dust available to convert optical photons 
into a much larger number of infrared photons [43]. The Causal Entropic Principle 
assigns a weight 100 times larger to our vacuum than to this one — simply based on 
the entropy production, without knowing anything about the potential advantages of 
heavy elements often claimed in anthropic arguments. 

This demonstrates that A^* can be an effective and very simple substitute for a 
number of dubious anthropic conditions. More importantly, our result lends credibility 
to AS" as a weighting factor for vacua with very different low-energy physics. Estimating 
the number of observers in such vacua, even averaged over a large parameter space, 
appears wholly intractable, but estimating AS" may not be. 

Thus, the Causal Entropic Principle may allow us, for the first time, to predict 
probability distributions over the entire landscape, rather than just over subspaces 
constrained to coincide with much of our low-energy physics. As discussed in Sec. |1|, 
this could lead to a breakthrough on extracting predictions directly from the underlying 
theory (the string landscape), without conditioning on parameters specific to our own 
vacuum. 

4.2 Understanding our distribution 

Now, let us turn to our main result. In our approach, the most likely range of logpA is 
set not by the time of galaxy formation, but by the time at which the rate of entropy 

^°It is amusing to note that this class of entropy producers includes the authors and the reader, in 
the sense that the Earth, like dust, is made of heavy elements and also absorbs starlight and re-emits 
in the infrared. 
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production, per unit time and unit comoving volume, is largest. This can be understood 
as follows. 

Consider, for the sake of argument, an entropy production rate that is independent 
of time and of p\. Then the total entropy AS produced within the causal diamond is 
proportional to j Vc{t) dt, where Vc is the comoving volume (or equivalently, the mass) 
present inside the causal diamond at the time t. This integral is the area under the 
curves shown in the lower panel of Fig. ^. 

At small times (near the bottom tip) the causal diamond is small and Vc is neg- 
ligible. After vacuum domination, at a time of order = (3/87rpA)^^^, the top cone, 
which contains one de Sitter horizon region, quickly empties out and M{t) vanishes ex- 
ponentially. Thus, only the era around the time of matter/vacuum equality contributes 
significantly to the integral. 

Up to a pA-independent factor of order unity, the above integral is therefore the 
product of t\M{t\). But M{tjC) is just the total mass inside the horizon around the 
time when A begins to dominate. This is again of order ^a and thus proportional to 
. Hence the total entropy produced, AS', scales like 1/pa in our hypothetical case 
of a constant entropy production rate. This is also clear by inspecting the area under 
the different curves in Fig. ^. 

Assuming a flat prior {dp/dp/^ = const, or equivalently, dp/d\ogp\ oc pa), the 
observer-weighted probability distribution is 

dP 

oc w{pa)pa , (4.1) 



d log Pa 

and the Causal Entropic Principle states that the weight is 

w = AS. (4.2) 

For the hypothetical, constant entropy production rate, we have w oc p^^, and 
hence 

— = const . (4.3) 

d log Pa 

The weight AS* in this case takes a prior distribution that was flat in pA into an 
observer-weighted distribution that is fiat in logpA, showing no preference between, 
say. Pa = IQ-^^i ^^d pa = IQ-^^s^ 

In the prior distribution, there are more vacua at large pa, so exponentially small 
values of pA are very unlikely. The above example shows that the Causal Entropic 
Principle captures an important compensating factor: vacua with smaller pA give rise 
to a larger causal diamond, i.e., to a bigger de Sitter horizon and a longer time until 
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vacuum domination. This allows for greater complexity and compensates for the rarity 
of such vacua. 

Next, let us consider the time-dependent entropy production rate we found in 



Sec. |3.3| . We found that the entropy production due to stars has a fairly broad peak 
around tpeak ~ 2 to 3.5 Gyr after the big bang. At earlier times, it is lower because 
fewer stars have formed; at late times, it is lower because few new stars form while 
older ones have burned out. 

Because of the time dependence, paAS will no longer be constant. For large values 
of Pa, the causal diamond is small, and it will contain only a small comoving volume 
by the time the entropy production peaks [tpeak ^ 4dge = 0.23)f:A; see Eq. In 



this regime, dP/ d\ogp\ will decrease with p\. For small pA, the causal diamond is very 
large, but the entropy production rate peaks early, when the comoving volume is still 
relatively small (tpeak ^ 4dge)- In this regime, dP/dlogp^ will increase with p\. This 
is illustrated in Fig. ^. 

Therefore, dP/ d log pA will be maximal for values of pA such that 

4dge(PA) = tpeak ■ (4.4) 

By Eqs. ( p.2|) and (|3.11| ), the observed value of logpA should be near 

logpA,peak ~ log(0.006/4,k) ~ "123 . (4.5) 

This rough estimate is borne out by our more careful calculation in Sec. |^. The excellent 
agreement of the observed logpA, Eq. (p,.l[), with this prediction is reflected in Fig. ^, 
where it can be seen that the edge time and the peak time really coincide for our 
universe. 

The width of our distribution can also be understood in this manner. Let ton and 
tofT be the times at which the entropy production rate is at half of its peak rate. Using 
those values in Eq. (|4.4| ) gives roughly the la boundaries we found for our distribution 



in Sec. 3.4. To summarize, the peak and the width of the probability distribution for 



Pa are related to the peak and width of the entropy production rate by Eq. ( [4.4|) . 

Our distribution has a greater width than the distribution obtained from the num- 
ber of observers-per-baryon; this can be seen clearly in Fig. |I]. This is also not hard 
to understand. In the traditional approach, nothing compensates for the exponential 
growth of dP/dlogp\ with logpA, until a fairly sharp cutoff occurs when logpA be- 
comes large enough to disrupt galaxy formation. Hence, the preferred values of log pa 
are squeezed into a narrow interval, and the observed value is strongly excluded. In 
our approach, the spacetime volume of the causal diamond depends inversely on pA, 
cancelling the pressure towards large values of pA- The width of the probability curve 
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Reheating Surface 



Figure 9: This cartoon demonstrates how the Causal Entropic Principle leads to a pre- 
ferred value of the cosmological constant. The horizontal band represents the rate of entropy 
production; darker areas correspond to a higher rate. Vacua are weighted by AS*, the total 
amount of "darkness" inside a causal diamond. Vacua with large cosmological constant are 
plentiful in the landscape, but they lead to small causal diamonds, which capture virtually 
no entropy production (right). For some smaller value, the diamond will be just large enough 
to capture the bulk of the entropy production (center). This is the preferred cosmological 
constant. Larger diamonds may capture slightly more AS* (left), but not in proportion to 
their size. They correspond to vacua with very small cosmological constant which are much 
rarer in the landscape. Therefore they will be suppressed. 

is set only by the shape of the peak of the entropy production rate (Fig. 6), which is 
fairly wide. 

In this discussion we have pretended that p\ does not affect the entropy production 
rate. In fact, this is an excellent approximation. In the vicinity the observed value of pa, 
the total entropy production depends on pA niainly through the geometry of the causal 
diamond. The probability density decreases away from this maximum. As a result, 
values of pA large enough to disrupt galaxy formation are highly suppressed even before 
we take into account the suppression of the entropy production rate resulting from this 
disruption. 

This points at another crucial difference between weighting by entropy production 
in the causal diamond, and weighting by observers-per-baryon: the preferred pa is set by 
completely different physical processes, and hence, by essentially unrelated timescales. 
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In the latter approach, one assumes that observers require galaxies. Then the disruption 
of galaxy formation cuts off the exponential growth of dP/d\ogp\. As a result, the 
preferred log p\ is set by the time when galaxies first form, and this gives a value that 
is too large compared to Eq. ( |1.1| ). 

In our approach, we do not assume that observers require galaxies. The size of the 
causal diamond depends inversely on pA, allowing the preferred range of values for log pa 
to be set by the time-dependence of the entropy production rate. The time of peak 
entropy production by dust heated by starlight picks out the value logpA ~ —123. The 
time-scale when galaxies form does not enter directly. In our universe, the difference 
between the two timescales amounts to "only" 3 orders of magnitude in the preferred 
Pa, but it is easy to imagine other vacua in the landscape where the era of peak 
entropy production is parametrically separated from the era when galaxy halos become 
nonlinear (for example, by a large galaxy cooling time). 

4.3 Statistical interpretation 

It is worth emphasizing that it is entirely irrelevant whether the observed pa is, say, 
0.5(7 above or 0.6a below the median of our distribution. We get to make only one 
measurement. There is no reason to expect this one data point to be on the median 
(or on the peak) of the probability distribution. But we can expect that it will not be 
a very unlikely value. Any value in the la region certainly qualifies as not unlikely. 
The success of the Causal Entropic Principle, its formal advantages aside, is not that 
it predicts the precise value of pa, but that our distribution shows that the observed 
value was not unlikely to have been observed. 

Physicists have a great degree of confidence in certain theories that make only 
statistical predictions, even though we are unable to make more than a finite number 
of measurements, let alone test all the consequences of a theory. In this spirit, our 
result improves our confidence in the Causal Entropic Principle and the underlying 
landscape. To improve our confidence further, we cannot repeat the measurement of 
the cosmological constant, but we can extract other predictions or postdictions and 
compare those to observation. 
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A. The radiation era 

In this Appendix, we justify our neglect of the radiation era. The metric, conformal 
time, and density during this era are 

arad(t) = C{t - to)'/' , (A.l) 

n^^it) = 2c-\t - toY/\ (A.2) 

'-'^'^ = 32.{t-toy ■ ^^-'^ 

The constants c and to ^^re determined by matching the Hubble constant and the scale 
factor to the metric Eq. ( p.3| ), which becomes 

2/3 

a(t) = ^-j (A.4) 

for t ^ t\. They must agree at the time teq, when the matter and radiation densities 
are equal, i.e., when [4] 

Prad = Pcq = 0.0026^" = 3.1 X IQ-'^'^ , (A.5) 

where 

^^3.3x 10-28 (A.6) 
is the observed mass of pressureless matter per photon. This yields 

t„ 1 / > X 



By Eq. (|3.9| ), the size of the causal diamond is set by the total conformal time 



duration of the universe since reheating, which is finite. In Sec. 3.1, we neglected the 
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radiation era and extended the matter/vacuum solution all the way back to the big 
bang {t = 0). This yielded a total conformal time 



from Eqs. and Q. 

In order to correct for the presence of the radiation era, we should subtract the 
conformal time interval At' corresponding to the era < t < tcq that should be excised 
from the matter/vacuum solution. It should be replaced by the conformal time interval 
At" corresponding to the radiation dominated era (the portion of the metric ( [A.l|) 
between reheating and matter domination). 

Using the above results, however, it is easy to show that 



Thus, the corrections to the conformal time, and thus to the size of the causal diamond, 
are negligible for pA < Peq- For example, with the observed value of p^, the correction is 
less than 1%. The probability of values of pa > 10^^^° almost vanishes according to our 
calculation; yet this is still 7 orders of magnitude below peq. Hence, our approximation 
is good in the entire range of pa in which our probabihty distribution has support. 
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